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Structured  Design  for  an  LPC 
Array  Processor 

1.  INTRODUCTION 

The  research  activities  of  the  several  ARPA  contractors  in  the  NSC  network 
speech  conpresslon  progran  have  reached  a  stage  of  success  in  LPC  algorithm 
development  to  merit  real  consideration  of  hardware  economy  and  efficiency. 

To  be  sure,  there  are  many  other  contributors  to  the  body  of  research 
supporting  the  LPC  technique;  and  most  all  of  it  taken  together  forms  a 
successful  whole  that  needs  to  be  supported  by  better  hardware  to  facilitate 
its  expression.  To  this  end,  ARPA  has  funded  a  modest  effort  at  CHI  to  design 
an  array  processor  that  is  particularly  well  suited  to  perform  the  important 
steps  of  the  family  of  LPC  algorithms  and  at  the  same  time  to  be  amenable  to 
LSI  Implementations.  What  follows  is  our  first  quarterly  report  on  this 
design  effort. 

A.  Summary  of  Contract  Intent 

The  design  team  at  CHI  has  been  engaged  in  developing  prograimuble  special 
purpose  computers  for  a  number  of  years.  Much  of  this  work  has  been  done  for 
ARPA,  principally  in  the  direction  of  on-line  research  systems  for  signal 
processing  in  the  acoustical  frequency  domain. 

Our  past  work  has  a  certain  architectural  stamp  or  style  that  results  from 
the  way  we  have  approached  the  design  process,  the  blending  of  very  different 
technical  backgrounds,  and  the  techniques  we  have  we  settled  on  for  solving 
internal  structure  problems  that  seem  to  have  resulted  in  surprising  effi¬ 
ciency  and  reliability  of  our  devices.  In  our  past  efforts  we  have  generally 
been  confronted  with  an  applicational  design  goal  and  with  a  fixed 
technology*  in  terms  of  which  the  design  must  be  realised.  With  the  rapid 
change  of  technology,  we  have  found  ourselves  designing  the  same  internal 
structures  in  different  ways  to  take  advantage  of  the  new  technology.  This 
has  led  us  to  ask  how  may  w«  represent  our  designs  so  they  can  be  independent 
of  the  technology  used  to  realise  them.  The  task  of  designing  an  LPC  array 

*ln  this  instance,  what  we  mean  by  a  technology  is  a  set  of  available  digital 
circuits  and  associated  connectors  and  construction  techniques. 
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processor  chat  can  be  prototyped  from  existing  components  and  then  later 
realized  as  a  single  (or  a  few)  LSI  chlpa,  brings  this  technology  Invariance 
question  forward  as  a  primary  issue  to  be  dealt  with. 

As  a  result  of  this,  Che  intent  of  our  work  on  contract  MDA903-78-C-0313 
is  not  only  to  interpret  the  basic  operations  occurring  in  various  LPC  analysis 
and  synthesis  algorithms  and  to  design  an  array  processor  chat  efficiently 
performs  them,  but  also  to  carry  out  the  research  and  development  required  to 
produce  a  design  system  that  will  make  it  possible  to  produce  technology- 
invariant  designs.  We  are  not  suggesting  that  this  can  be  done  "untouched  by 
human  hands,"  for  this  is  not  in  the  nature  of  architecture;  but  rather  that 
the  human  input  to  the  design  system  will  be  primarily  limited  to  providing 
creative  organizational  expression  while  the  computer  part  of  the  system 
responds  to  this  input  by  generating  a  number  of  important  design  outputs  such 
as  a  logic  simulator  that  realizes  an  equivalent  design,  and  after  geometrical 
specification  and  primitive  module  specification,  a  connector  sort,  a  logic 
sort,  and  ultimately,  diagnostic  software.  Of  course,  all  of  this  points  in 
the  direction  of  how  to  use  a  computer  system  to  help  us  establish  how 
successful  a  design  is  before  we  build  it  and  how  to  prove  it  works  correctly 
after  it's  built. 


B.  Current  Stage  of  Development 

By  the  time  contract  work  began  we  had  come  to  understand  in  qualitative 
terms  the  two  principal  issues  involved: 

1.  What  do  the  LPC  equations  indicate  about  the  choice  of  array 
processor  architecture  and  how  does  this  relate  to  earlier 
array  processors. 

2.  What  are  the  important  software  objects  that  could  be  expected 
to  work  together  to  achieve  a  structured  design  system  that 
would  help  us  solve  the  reconveyance  problem. 

Consequently,  we  were  able  to  allocate  tasks  that  could  proceed  in 
parallel.  These  turned  out  to  be  of  two  kinds  —  those  that  were  clearly 
understood  and  would  play  a  definite  roll  in  the  work  to  be  accomplished  and 
those  that  would  gather  information  that  might  help  us  find  out  how  to  deal 
with  the  intangibles  of  our  problem.  In  the  latter  category  we  had  to: 
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a.  Compare  the  equations  arising  in  representative  LPC  algorithms 
and  establish  performance  criteria  at  the  array  process  level. 

b.  Review  the  steps  of  our  prior  design  efforts  and  the  associated 
documentation  and  seek  a  restructuring  of  both  these  steps  and 

the  documentation  to  make  the  whole  of  It  computationally  derivable 
from  a  structured  design  document. 

c.  Study  potential  data  structures  and  associated  control  functions 
that  could  lead  to  a  logical  simulator  that  Is  computationally 
derivable  from  a  structured  design  document. 

At  present,  we  can  report  on  successful  accomplishment  of  these  tasks 
and  are  well  along  In  Implementing  their  consequences.  More  specifically: 

1.  Ue  have  a  design  for  an  LPC  array  processor  ready  for  testing. 

2.  A  user  input  language  for  specifying  a  logical  design  has  been 
devised  and  a  program  has  been  written  —  called  the  logic 
assembler  —  that  deduces  a  data  structure  which  represents 
the  logical  design  and,  as  such,  provides  the  basis  for  logic 
simulation. 

3.  We  have  established  the  concept  of  module  substitution  within  the 
logic  simulator  and  have  designed  a  module  analyzer  and  Its 
Interface  to  the  ARPA  signal  processing  system.  By  specifying 

to  the  logic  asseodbler  that  a  certain  module  Is  to  be  replaced 
during  simulation  b7  actual  hardware  module, we  can  cause  the 
logic  simulator  to  control  the  module  analyzer  and  capture  the 
hardware  moduleh  outputs  for  further  use  In  the  simulation.  What 
this  means  Is  that  with  the  device  represented  by  software,  we 
can  test  a  given  hardware  module  "In  situ"  for  the  whole  device 
without  having  the  rest  of  the  device  in  hand. 

4.  We  have  restructured  the  nature  of  our  hardware  documentation  to 
achieve  a  form  that  can  be  automatically  generated  and  yet  be  of 
similar  use  to  engineers  In  both  checkout  and  maintenance. 

Indeed,  an  engineer  has  to  become  familiar  with  Its  use,  but  Its 
simplicity  appears  to  make  this  no  great  difficulty. 
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5.  Prograns  that  produce  design  outputs  of  this  selected  docussintatlon 
fomat  are  being  written  and  some  Intermediate  results  are  ready 
for  use. 

6.  We  have  begun  specifying  the  test  programs  for  the  LFC  array 
processor. 

The  work  remaining  to  be  done  on  the  contract  consists  of  building  and 
checking  out  the  prototype,  finishing  the  software  for  the  structured  design 
system  and  demonstrating  the  performance  of  the  prototype  on  representative 
LPC  algorithms. 

In  discussions  leading  up  to  this  contract  work  It  was  anticipated  that 
after  the  array  processor  design  was  appropriately  in  hand,  we  would  be  In  a 
position  to  cooperate  with  some  selected  LSI  house  as  an  aid  In  the  technology 
transfer.  Our  estimate  Is  that  by  the  end  of  the  second  quarter  we  will  be 
In  a  position  to  do  this. 
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2.  A  PRELIMINARY  DESIGN  FOR  AN  LPC  ARRAY  PROCESSOR 

In  J.  Hakhoul's  paper  "Stable  and  Efficient  Lattice  Methods  for  Linear 
Prediction,"  the  best  known  methods  of  LPC  analysis  are  brought  together  In 
a  simple  common  framework.  We  choose  this  as  our  principal  reference,  partly 
because  of  Its  clarity,  but  mostly  because  It  reveals  a  wide  range  of  computa¬ 
tional  complexities  engaged  In  the  LPC  techniques.  Bolling  these  down  to 
their  essences  results  In  only  a  few  prime  considerations,  nevertheless 
arranging  for  their  adequate  Implementation  brings  us  as  close  to  a  general 
purpose  array  processor  as  we  dare  risk.  The  point  of  concern  is  this  —  when 
a  family  of  tasks  are  to  be  carried  out  in  a  single  special  purpose  processor, 
they  must  possess  some  underlying  simplicity  that  results  In  design  advantage 
or  we  are  forced  to  a  general  purpose  design  and  the  power  of  our  approach  Is 
lost.  Under  the  latter  circumstances  we  must  restrict  the  family  before  we 
can  achieve  a  successful  design. 

The  Input  to  the  analysis  process  comes  from  an  A/D  converter  which  may 
as  well  be  taken  In  Integer  format.  The  dynamic  range  of  voice  signals  Is 
such  that  at  least  8  blts/sample  are  required  to  perform  any  sensible  analysis 
and  10  bits  are  as  low  as  most  implementers  are  willing  to  consider.  In  fact, 
most  systems  of  reasonable  quality  either  use  12-blt  samples  or  incorporate 
some  form  of  secondary  gain  ranging.  In  what  follows  we  will  assume  that 
the  Input  data  Is  12  blts/sample  and  In  Integer  format.  As  to  conversion 
rates,  the  NSC  conferencing  system  used  150  ys/sample  corresponding  to  3.2  Khz 
Nyqulst  frequency  which  Is  marginally  acceptable  for  male  voices  but  unfor¬ 
tunately  low  for  some  female  users  with  very  small  vocal  tracts.  A  sample 
Interval  of  50  ys  guarantees  good  sibilant  data  for  practically  all  users, 
so  we  assume  that  the  sample  Interval  AT  is  In  the  range  50  ys  ^  AT  £  150  ys. 

In  order  to  set  up  an  array  oriented  process,  we  pick  a  data  block  size  long 
enough  to  get  statistical  benefit  In  estimating  the  vocal  tract  area  function 
and  short  enough  to  get  good  resolution  of  the  change  of  this  shape  with  time. 
Accordingly,  we  can  expect  analysis  block  sizes  to  be  In  the  neighborhood  of 
100  samples  with  corresponding  tliae  bases  In  the  range  of  5  to  15  milliseconds. 
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In  keeping  with  Makhoul'a  notation,  let  f  and  b  represent  forward  and 
backward  residuals  and  K  denote  reflection  coefficient.  Since  -1  <  K  <  1, 
the  direct  lattice  filter  equations: 


Vi<">  ■ 


requires  us  to  perform: 


1) 


(Fraction  x  Integer) +  Integer  -►  Integer  2) 

as  an  array  process.  The  Inverse  filter: 


Vi<“>  ■ 


3) 


Incorporates  feedback  and  for  small  amplitudes  engenders  a  limit  cycle 
pathology.  To  handle  these  properly,  we  propose  to  Incorporate  an  Integer  by 
fraction  multiplication  with  rounding  end  to  overlap  the  add  (or  subtract)  opera¬ 
tion.  Additionally,  to  guard  against  overflow,  we  will  arrange  that  only  11 
of  the  12  bits  of  Integer  data  will  be  significant  going  into  the  above  equa¬ 
tions.  Also,  as  a  means  to  stay  away  from  the  limit  cycle  pathology,  we  will 
sense  the  output  size  Information  on  the  fly.  That  Is,  when  the  input  data 
Is  too  large,  we  will  use  a  bullt-ln  ri^t  shift  In  the  array  process  and 
when  the  output  data  Is  too  small,  we  will  insert  an  auxiliary  left  shift 
array  process. 

a)  If  the  12th  bit  of  the  Input  arrays  Is  significant,  right  shift  on 
read. 


b)  If  the  output  array  has  no  more  than  8  significant  bits,  left  shift 
It  3  places  before  proceeding. 


In  the  computation  of  Inner  products  of  Integer  arrays  like: 

f*f,  ^*b_j^  where  b  j^(n)  -  b(n-l) 

we  need  to  use  Integer-by- Integer  multiplication,  and  since  our  arrays  have 
about  100  elements,  we  need  an  Integer  multiply  overlapped  with  addition  into 
an  extended  accumulator. 

The  noncorrective  methods  require  much  more  precision  than  the  corrective 
ones;  for  this  purpose,  we  provide  programmed  floating  point  and  double 
precision  operations.  We  are  all  aware  of  the  much  lower  operation  count  of 
these  methods  compared  to  the  reduction  of  residual  corrective  methods,  but 
to  be  fair,  we  should  count  them  In  at  least  the  double  precision  sense.  In 
many  cases,  not  having  the  residuals  as  an  output  of  the  computations  creates 
a  need  for  extra  processes  In  the  overall  algorithm,  thereby  further  reducing 
their  apparent  advantage. 

In  this  section  we  present  our  current  results  In  the  development  of  an 
array  processor  tailored  to  the  requirements  of  real  time  LPC  analysis  and 
synthesis.  In  keeping  with  the  contract  Intent  discussed  In  section  1,  we 
will  later  express  the  design  In  such  a  manner  that  It  can  be  equivalently 
realized  In  different  technologies.  To  do  this  we  will  use  the  structured 
design  approach  presented  In  section  3  with  our  overall  design  goals  esta*- 
bllshed  as  a  creative  Interpretation  of  the  requirements  set  forth  through 
the  review  of  typical  LPC  algorithms. 

The  highest  level  top-down  subdivision  of  the  LPCAP  splits  It  into  four 
submodules:  AFU,  CPU,  PS,  and  10. 

Their  overall  functions  can  be  described  as  follows: 

1.  The  APU  module  carries  out  all  the  arithmetic  operations  on  data  as 
specified  by  the  right-hand  part  of  the  instruction  lines  (INSTRR). 

Its  registers  provide  source  data  for  output  to  the  10  module. 

Certain  characteristic  bits  provide  status  Information  to  the  CPU  that 
may  be  used  In  CPU  conditionals. 

2.  The  CPU  module  performs  program  sequencing,  data  addressing.  Indexing, 
APU  and  10  control.  Its  operations  are  specified  by  the  left-hand 
part  of  the  Instruction  lines  (INSTRL).  In  run  mode  these  Instruction 
lines  are  supplied  by  some  of  the  outputs  of  PS.  When  a  HOST  Is 
present  and  when  the  CPU  Is  halted,  these  Instruction  lines  are  supplied 
by  the  10  module,  thus  providing  the  HOST  with  single  step  control  of 
the  CPU. 
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3.  The  PS  module  Is  a  program  source  module  addressed  by  the  CPU.  When 
PS  is  a  RAM  (or  contains  a  subset  that  Is  a  RAM)  it  can  be  loaded  by 
a  sequence  of  outputs  from  the  10  module  under  control  of  the  HOST. 

4.  The  10  module  provides  an  Interface  between  the  LPCAP  and  the  external 
world.  Aside  from  vrhatever  complexities  are  Introduced  by  the  pre¬ 
sence  of  a  HOST  machine.  It  must  manage  voice  A/D  and  D/A  conversion 
and  receive  and  transmit  voice  parameters  to  some  communication  link. 

The  main  substance  of  this  section  Is  the  treatment  of  each  of  these 
modules  In  turn. 


COMfUNICATION 


Figure  2.1  Block  diagram  showing  the  relationship  of  the 
four  principal  s\d>modules  of  the  LPCAP. 
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A.  The  APU  Module 


The  arithmetic  requirements  on  the  APU  module  Imposed  by  the  form  of  the 
equations  used  In  LPC  analysis  and  synthesis  may  be  summarized  as  follows: 

1.  The  dynamic  range  of  voice  signals  are  best  represented  by  12-blt 
samples . 

2.  Precise  Inner  products  of  sample  vectors  with  at  least  128  components 
are  required  for  the  noncorrective  methods  of  computing  reflection 
coefficients. 

3.  Fraction  by  Integer  multiplication  Is  required  for  forming  linear 
combinations  of  signals. 

4.  Both  down  scaling  to  guard  against  overflow  and  upscaling  to  guard 
against  limit  cycle  pathologies  of  digital  filtering  are  required. 

5.  The  same  Inputs  must  apply  to  both  sides  of  the  multiplier  so  that 
the  sum  of  squares  can  be  computed  without  copying  a  vector. 

6.  The  quotients  count  In  LPC  algorithms  Is  so  low  that  a  programmed 
divide  will  suffice. 

Since  the  LPC  algorithms  have  a  very  concentrated  requirement  for  multi¬ 
plies  and  since  most  of  these  are  associated  with  adds,  we  should  focus  our 
attention  on  multipliers  that  have  configurations  with  both  Integer-by- Integer 
full  precision  and  fractlon-by- Integer  rounded  outputs.  Such  multipliers  are 
already  available  with  execution  times  In  the  neighborhood  of  microprocessor 
Instruction  times  so  the  architectural  considerations  In  loading  and  unloading 
data  Is  of  paramount  Importance.  A  powerful  Illustration  of  how  to  do  this 
right  and  the  unfortunate  consequences  of  not  doing  It  right  Is  provided  by 
comparing  multlply-add  throughputs  of  the  TRW  chips  MP12AJ  and  MP16AJ.  The 
first  can  be  used  with  no  loss  time  In  a  looped  multlply-add  and  the  latter 
has  a  loss  essentially  matching  Its  execution  time.  Our  point  Is  not  to 
flamboyantly  snipe  at  the  MP16AJ  but  rather  to  emphasize  the  value  of  the 
architecture  of  the  MP12AJ.  Even  though  LSI  technology  will  soon  support  the 
complete  LPCAP  on  one  chip,  the  Internal  multiply  submcdule  should  have  the 
architecture  of  the  MP12AJ. 


9 


/ 


with  the  architecture  of  the  multiply  nailed  down,  we  can  extend  the 
structure  to  account  for  the  requirements  listed  above. 

a.  Since  the  forward  and  backward  waves  are  the  primary  Inputs  to  the 
multiplier  In  a  Burg-type  analysis,  and  since  the  sum  of  squares  of 
each  Is  to  be  computed,  we  must  provide  a  2-to-l  multiplexer  for 
each  of  the  multiplier  Inputs  (cf  figure  2.2);  then  5)  Is  satisfied. 

b.  In  order  to  provide  precise  inner  products  of  12-bit  data  in  128 
word  blocks,  we  use  the  full  integer  product  with  sign  extension  to 
32  bits  and  use  this  as  the  A  Inputs  to  a  32-blt  adder,  with  a 
32-blt  accumulator  attached  to  the  B  side;  this  takes  care  of  2) . 

c.  Since  the  reflection  coefficients  are  fractions  between  -1  and  +1, 
when  they  multiply  a  vector  with  integer  components  to  give  a  vector 
of  the  same  type  we  must  use  the  fraction-by- Integer  product  with 
rounding.  To  add  this  product  to  another  such  vector,  as  In  the 
lattice  filter  computation,  we  also  need  a  12-blt  accumulator. 

d.  The  scaling  problems  Intrinsic  to  LPC  analysis  and  synthesis  are  not 
very  severe,  yet  we  must  have  some  means  for  handling  them.  One 
simple  way  to  do  this  Is  to  maintain  two  flag  bits  under  program 
control,  call  them  Big  and  Small.  Let  Big  selectively  monitor  the 
data  written  In  pad  and  express  the  or  of  the  exclusive  ors  of  the 
left-hand  two  bits. of  each  array  value.  Then  Big  Is  a  1  If  any 
array  datum  Is  of  full  scale.  Now  let  Small  monitor  the  left-hand 

4  bits  In  the  same  manner.  Then  Small  Is  a  0  If  no  array  datum  has 
more  than  8  significant  bits.  The  capability  of  clearing  and  testing 
these  flag  bits  and  maintaining  a  scale  word  In  pad  to  account  for 
change  of  scale,  provides  a  software  overview  of  the  scaling  relations. 
We  Include  downscaling  hardware  on  the  output  from  pad  and  will  use 
the  multiply  hardware  In  a  special  pass  to  upscale  the  data  when 
required. 

e.  One  of  the  aspects  of  the  special  forms  of  the  equations  relating 
the  forward  and  backward  waves  coupled  with  the  direct  association 
of  adders  and  accumulators  la  that  we  sometimes  need  to  exchange 
outputs.  Perhaps  a  stronger  way  to  say  this  is  that  we  must  be  able 
to  move  data  between  scratch  pad  arrays.  To  satisfy  this  desire,  we 
Include  a  pair  of  2-to-l  output  multiplexers. 


10 


/ 


V 


A 


f.  The  final  step  In  this  overview  of  the  AFU  consists  of  providing 
means  for  asseiid)llng  and  dlsaaseid>llng  byte-oriented  data  sets. 

To  this  end  we  have  selected  two-way-ln  registers  for  the  accumulators 
with  Interconnections  to  an  8-blt  accumulator  accepting  an  external 
Input . 

In  figure  2.2  below,  we  give  a  module  schematic  for  the  AFU  together  with 
a  decoding  chart  with  the  module  operation  code  definitions. 

B.  The  CFU  Module 

The  AFU  module  discussed  above  carries  with  It  all  of  the  significance 
of  design  relating  to  LFC  specializations.  For  the  CFU  we  are  after  the 
simplest  sufficiently  rapid  control  to  get  the  Job  done.  Whereas  a  12-blt 
structure  was  used  In  the  AFU  module,  an  8-blt  structure  Is  the  proper  choice 
for  the  CPU  module.  The  two  scratch  pads  (X  and  Y)  of  the  AFU  need  only  be 
256  words  long  to  provide  adequate  data  buffering,  so  corresponding  to  these 
the  CPU  contains  two  8-blt  address  registers  (XA  and  YA).  Present  estimated 
program  size  Indicates  that  with  an  Instruction  page  size  of  256  Instructions, 
we  can  expect  the  subroutines  for  analysis  and  synthesis  to  each  fit  within 
a  page.  For  prototyping  purposes  we  will  use  a  4096  instruction  program 
source  (the  PS  module  discussed  In  C)  with  48-blt  instructions  and  256  per 
page.  In  a  real  device  this  will,  of  course,  be  limited  to  the  size  deter¬ 
mined  by  programming  the  algorithms. 

The  CPU  can  now  be  summarized  as  follows.  Starting  with  three  address 
counters  (PSA,  XA,  YA)  of  8  bits  and  an  8-blt  tally  register  for  array  loop 
control  we  bus  these  together  with  8  bits  from  program  source  (the  PS  value 
field).  Branching  according  to  the  tally  register,  the  sign  bit  of  the  AFU 
adder  and  10  status  bits  are  then  provided. 

In  setting  up  an  array  process,  the  bus  connection  between  the  PS  value 
field  and  the  Internal  registers  of  the  CPU  is  used  to  Initialize  them  one  at 
a  time.  In  the  array  process,  all  of  the  registers  can  be  simultaneously 
controlled  In  each  micro  Instruction.  In  order  to  provide  control  for  the 
10  module,  the  bus  mentioned  above  Is  extended  to  the  10  module.  In  the 
bus  control  fields  (TRM  and  BBC)  shown  In  figure  2.4  below,  we  can  address  a 
device  and  then  Input  or  output  as  required. 
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Figure  2.2  AFU  Hodul*  Scheaatic 


C.  The  PS  Module 


The  prograa  source  module  hss  been  designed  to  allow  full  convenience 
in  iaq>leBentlng  trial  algorithms  that  will  need  to  assess  the  real  time 
performance  of  the  LPC  AP.  The  design  is  elementary  and  merits  no  special 
discussion. 

/ 

D.  The  10  Module 

The  design  of  the  10  module  has  been  specialized  to  provide  an  inter¬ 
face  between  the  LPC  AP  and  the  Serial  Three  signal  processing  research 
system.  Our  approach  here  has  been  to  take  advantage  of  the  module  analyzer 
—  one  of  the  key  elements  in  the  structured  design  system  discussed  in 
section  3.  The  module  analyzer  is  directly  Interfaced  to  the  host  machine 
and  can  be  used  as  a  prograflatable  test  fixture  to  check  out  the  individual 
modules  of  the  LPC  AP  as  well  as  the  entire  device. 

The  10  module  provides  a  means  for  loading  programs  into  PS,  trans¬ 
ferring  sample  data  blocks  and  parameter  blocks  to  and  from  the  Host. 
Additionally,  it  provides  the  logic  of  a  pseudo  panel  that  permits  the  Host 
to  single  step  operations  of  the  LPC  AP. 
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3.  THE  STRUCTURED  DESIGN  TECHNIQUE 


Logical  design  can  be  thought  of  as  a  creative  process  In  which  larger 
modules  are  defined  as  cpiid>lnatlons  of  smaller,  simpler  ones  In  such  a  way  as 
to  fulfill  basic  design  goals. 

The  approach  can  be  "bottom  up."  In  this  case  the  designer  visualizes 
combinations  of  available  parts,  and  combinations  of  these,  always  working 
towards  accomplishing  the  desired  goals. 

An  alternative  Is  a  "top  down"  approach  in  which  the  design  is  subdivided 
Into  sections  which.  If  Implementable,  would  achieve  the  basic  goals.  These 
sections  are  again  decomposed,  and  so  on,  until  the  resulting  blocks  are 
seen  to  be  bulldable  from  available  parts.  The  subdivisions  represent  Inter¬ 
mediate  design  targets. 

A  third  approach  works  from  both  directions  simultaneously,  hoping  to 
meet  In  the  middle.  At  that  point,  sections  which  are  known  to  accomplish 
the  design  goals  are  seen  to  be  realizable  as  combinations  of  coid>lnations  of 
available  hardware. 

In  any  of  these  approaches  the  final  design  can  be  conceived  as  a  single 
entity  or  expressed  In  modular  terms.  However,  modular  design  simplifies 
checkout,  allows  for  consideration  of  more  design  alternatives,  and  facili¬ 
tates  making  changes. 

A  structured  design  Is  a  tree-like  hierarchy  having  the 
following  characteristics. 

1.  For  each  module  In  the  structure,  the  outputs  are  uniquely  determined 
by  the  Inputs,  the  controls  and  the  status  of  memory. 

2.  Each  module  In  the  hierarchy  Is  either  primitive  (not  broken  down 
further)  or  composite  (a  combination  of  lower  level  modules  called 
the  submodules  of  the  module) . 

3.  The  definition  of  a  composite  module  involves  only  its  submodules. 

No  reference  to  submodules  of  the  submodules  Is  allowed. 

This  means  that  from  the  point  of  view  of  a  module,  there  is  only  one 
level  below  It.  All  strata  beneath  are  invisible  to  the  module,  and  each 
module  acts  as  a  primitive  relative  to  higher  level  design. 
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In  a  hardware  realization  of  the  design,  the  primitives  might  be  chips, 
and  lower  level.  Intermediate  modules  might  be  groups  of  chips  wired  together. 
A  higher  level  Intermediate  module  might  be  realized  as  a  board  or  a  group 
of  boards  connected  by  cables. 

A.  Reconveyance  of  Design 

Assume  that  a  design  has  been  created  which  accomplishes  the  original 
goals.  Of  course,  many  alternative  designs  could  be  produced  fulfilling  the 
same  criteria.  In  particular,  one  of  the  Intermediate  modules  (along  with 
Its  substructure)  could  be  replaced  by  a  different  coiri>lnatlon  of  submodules 
(along  with  their  substructures)  which  performed  the  ssstt  function.  If  all 
of  the  relationships  among  other  elements  of  the  hierarchy  were  preserved, 
the  resultant  design  would  fulfill  the  original  goals.  Similarly,  substitu¬ 
tion  for  an  Intermediate  module  In  the  new  design  would  lead  to  an  additional 
acceptable  design. 

Thus,  one  design  can  be  re conveyed  by  a  whole  family  of  equivalent  designs. 
The  highest  level  subdivision  of  the  original  design  Is  of  critical  Importance 
since  all  members  of  Che  family  Inherit  the  top  structure. 

If  an  Intermediate  module  has  been  thoroughly  checked  out,  either  by 
theoretical  simulation  or  by  prototyping  and  hardware  checkout,  the  design 
can  be  reconveyed  Into  one  In  which  the  module  in  question  Is  a  primitive. 

The  design  Is  slnqillfled  by  removing  the  substructure  of  the  module  from  the 
hierarchy. 

This  Is  of  practical  as  well  as  theoretical  interest.  For,  at  any  stage, 
from  design  through  prototyping  or  even  production,  technological  advances 
can  be  easily  Incorporated  Into  Che  existing  design.  If  a  function  which  was 
previously  computed  In  a  cosqioslce  module  Is  found  to  be  perforauible  by  a 
new  device  (for  example,  a  chip)  the  original  design  can  be  reconveyed  Into 
one  where  that  module  Is  a  primitive.  It  would  not  be  necessary  to  redesign 
Che  whole  system,  which  would  be  required  If  there  were  no  modularity. 
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B.  Computer  Involvement  In  the  Design  Process 

A  computer  cen  be  brought  into  the  design  process  in  several  ways. 

Elements  of  a  computer  aided  design  system  are  given  below. 

Given  an  inventory  of  primitives  and  a  language  suitable  for  describing 
composite  modules,  a  logic  assembler  can  produce  a  data  structure  encapsulating 
the  design.  From  this  structure,  appropriate  programs  can  generate  the  tedious 
documentation  necessary  to  implement  the  design  —  for  example,  connector  sort 
and  wire  list.  This  frees  the  designer  to  concentrate  on  more  creative  aspects 
such  as  alternative  designs  of  intermediate  targets. 

An  editor  which  enables  changes  to  be  made  in  documents  defining  the 
design  facilitates  consideration  of  alternatives. 

A  logical  simulator  can  be  used  to  calculate  the  response  of  any  module 
in  the  system  to  particular  seta  of  Inputs  and  controls  (not,  of  course,  in 
real  time) .  The  simulator  works  on  the  data  structure  set  up  by  the  logic 
assembler.  At  each  stage,  the  Inputs  and  controls  are  passed  on  to  submodules 
until  they  reach  a  primitive  module.  Then  a  subroutine  pointed  to  in  the 
prlmtlve's  description  is  invoked.  The  outputs  thus  calculated  are  passed 
back  to  the  higher  level  module.  Eventually,  when  the  actions  of  all  8ub~ 
modules  of  a  module  have  been  simulated,  the  outputs  of  that  module  are 
implicitly  determined.  In  this  way,  the  design  can  be  exercised,  while  still 
in  theoretical  form,  to  make  sure  that  each  module  conforms  to  design  goals. 

When  a  module  has  been  realized  by  engineering  design  and  prototyping, 
its  adequcny  as  part  of  the  overall  design  can  be  tested  by  Integrating 
the  action  of  a  module  analyzer  with  the  simulator.  The  hardware  realization 
of  the  module  is  plugged  into  the  analyzer  and  switches  are  set  to  indicate 
which  connector  elements  represent  controls  inputs  and  outputs. 

At  the  stage  in  the  simulation  when  execution  of  the  module  is  required. 
Inputs  and  controls  are  supplied  to  the  external  device  through  the  module 
analyzer.  The  resulting  outputs  produced  by  the  device  become  the  values 
which  are  used  at  the  next  stage  of  the  simulation. 
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C.  A  Mathematical  Model  for  Computer  Design 


In  order  to  Implement  a  computer  aided,  design  facility  It  Is  necessary 
to  have  an  underlying  mathematical  model  of  a  computer  system  and  the  process 
by  which  the  system  Is  constructed  from  components.  This  'theoretical' 
computer,  as  well  as  Its  components, will  be  called  a  logical  module.  The 
model  should  be  such  that  Its  features  and  functions  can  be  translated  Into 
a  computer  acceptable  language. 

The  purpose  of  this  section  Is  to  provide  basic  definitions  and  clarify 
the  synthesis  process  by  revealing  rules  of  combination  which  result  in 
logical  modules.  We  must  also  develop  a  language  that  provides  precise 
expression  of  one  logical  module  In  terms  of  others.  These  concerns  motivate 
the  definitions  which  follow. 

The  definitions  are  In  terms  of  vector  spaces,  so  as  a  preliminary  step, 
we  Indicate  notations  which  will  be  used  elsewhere  In  this  section. 

Let  X  and  Y  be  finite  dimensional,  vector  spaces  over  finite  fields. 

1.  X  X  Y,  the  cross-product  of  X  and  Y,  Is  the  set  of  pairs  of  vectors 
(x,  y)  where  x  e  X  and  y  e  Y. 

2.  If  X  and  Y  are  over  the  same  field,  then  X©Y,  the  direct  sum  of  X 
and  Y,  Is  the  set  of  vectors  x|y.  If 

X  •  (xi,  ...  ,  X  )  ex  and  y  -  (yi,  ...  ,  y  )  eY 
n  B 

then  x|y  has  coordinate  expression 

xly  “  (xif  •••  f  yi,  ,  y  ) 

n  ID 

3.  If  X  and  Y  are  of  the  same  dimension,  n,  and  over  the  same  field, 
they  are  Isomorphic  and  can  be  Identified.  This  Identification  will 
be  denoted  X  Y. 

A.  The  dimension  of  X  will  be  denoted  by  dim  X 
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Logical  Modules 


Defn  1:  A  Logical  Module  L  la  an  ordered  sextuple 
L  -  <C,  I,  10,  M,  0,  F> 

where  C,  I,  10,  M,  0  are  finite  dloenslonal  vector  spaces  over 
finite  fields  and  F  Is  a  function  from  C  x  I  x  10  x  M  Into  0  such 
Chat,  for  each  set  of  vectors  ceC,  lei,  loelO,  meM  there  Is  a  unique 
vector  oeO  for  trtilch 

F(c,  1,  lo,  m)  -  o 

The  notation  used  here  has  the  following  mneumonlc  relations  ; 

C  Is  the  control  apace  of  the  module 
I  Is  Che  Input  space  of  the  module 
10  Is  the  Input /output  space  of  the  module 
M  Is  the  memory  space  of  the  module 
0  Is  the  output  space  of  the  module 
F  Is  the  function  of  the  module 

10  and/or  M  may  be  empty.  The  vectors  of  10  represent  either  input 
or  output  depending  upon  the  control  vector. 

The  outputs  can  be  'swltchable, '  In  idiich  case  the  field  associated 
with  0  has  at  least  three  elements. 

For  simplicity  0  Is  presented  as  a  single  space;  however.  It  some 
but  not  all  outputs  are  swltchable,  0  is  really  a  pair  of  vector 
spaces. 

The  crux  of  the  definition  dictates  that  for  every  coablnation  of 
control.  Input,  10  and  memory  vectors,  there  Is  a  unique  output  vector 
'computed*  by  the  function  F. 

The  concept  of  a  logical  module  applies  over  a  broad  range  of  complexi¬ 
ties,  extending  from  'chips'  to  an  entire  computer  system. 

Defn  2:  Let  L  be  a  logical  module.  The  boundary  of  L,  denoted  by  5L, 
is  the  quadruple 

6L  -  <C,  I,  10,  0> 
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The  quantity 


dim  6L  ■■  dim  C  dim  I  +  dim  10  +  dim  0 
will  be  called  the  dimension  of  the  boundary. 

The  boundary  Is  not  Itself  a  vector  apace,  since  the  underlying  fields 
of  the  constituent  spaces  may  be  different.  However,  the  Individual 
coordinates  of  the  constituent  spaces  can  be  ordered  to  run  from  1  to 
dim  5L  (as  for  x|y).  This  will  be  called  an  ordering  of  the  elements 
of  the  boundary. 

Defn  3:  A  logical  module  L  Is  primitive  If  Its  function  F  Is 
explicitly  given  as  part  of  the  module's  definition. 

Defn  4:  A  logical  module  L  Is  composite  If  It  Is  defined  by  a  com¬ 
bination  of  logical  modules 

^4  I'll  1*2,  ...  ,  !•_ 

1  n 

The  are  called  submodules  of  the  module  L.  A  submodule  can  be 
either  primitive  or  Itself  a  composite  module.  Only  primitive 
modules  can  have  memory. 

In  order  to  combine  logical  modules  and  form  more  comprehensive 
structures  which  are  themselves  logical  modules,  we  must  specify  a 
boundary  correspondence  that  defines  a  boundary  6l  for  a  composite 
module  L  In  terms  of  the  boundaries  6l^  of  the  constituent  subiaodules. 
The  boundary  elements  of  6L  must  be  associated  with  boundary 
elements  of  submodules.  A  single  element  of  5l  may  be  associated 
with  elements  In  more  than  one  submodule  or  even  with  more  than  one 
boundary  element  of  a  single  module.  A  further  restriction  on  the 
boundary  correspondence  Is  that  output  elements  of  the  composite 
module  boundary  be  associated  with  output  elements  of  the  submodules. 
(It  Is  not  required  that,  for  example,  each  Input  to  the  composite 
module  be  an  Input  to  some  subsndule.) 

It  will  usually  be  the  case  that  there  are  some  submodule  boundary 
elements  which  are  not  associated  with  elements  of  the  composite 
boundary.  Mappings  may  be  specified  among  such  elements. 


These  mappings  are  called  Interconnects. 

From  the  definitions  above.  It  seems  clear  that  the  function  of  a 
composite  module  Is  derivable  from  the  structural  relationships  of 
the  boundaries  of  Its  submodules  and  the  submodule  functions.  In 
fact,  the  boundary  correspondences  and  Interconnects  define  the 
function  F  of  the  composite  module  L  implicitly  In  terms  of  the 
functions  F^  of  the  submodules  L^. 

To  aid  us  In  giving  some  specific  examples  of  combinations,  we  Intro¬ 
duce  the  following  notions. 

Defn  5:  Two  logical  modules  are  equivalent  If  there  exists  a  one-one 
correspondence  of  their  boundaries  which  results  In  their  functions 
becoming  Identical. 

Defn  6:  A  composite  logical  module  Is  homogeneous  If  all  of  Its  sub- 
modules  are  equivalent. 


Simple  Composite  Modules 

As  a  rule  It  Is  difficult  to  deslcribe  the  functional  relationships 
between  the  function  F  of  the  composite  module  L  and  the  functions  F^ 
of  the  submodules.  In  such  a  case,  the  boundary  correspondences  and 
Interconnects  can  be  listed  element  by  element.  However,  when  the 
boundary  mappings  can  be  described  in  vector  terms,  F  often  has  a 
simple  representation  In  terms  of  the  F^.  We  detail  a  few  such  situa¬ 
tions  below. 

Parallel  Modules 

In  the  design  of  a  computer  system,  we  frequently  encounter 
logical  modules  that  occur  In  parallel  configuration;  that  Is,  their 
Inputs,  outputs,  memories  and  controls  are  respectively  Independent 
and  consequently  their  functions  can  be  simultaneously  used. 

For  example,  consider  an  'or'  chip  with  4  'or'  gates  which  can  be 
used  Independently.  In  this  case  the  individual  gates,  not  the  chip, 
are  the  primitives;  the  chip  represents  a  composite  module  with 
parallel  submodules. 


r-  y 

1 

I  Let 


Li 

*  “^Cj,  Ii, 

lOi, 

,  Ml, 

Oi, 

F I >  and 

L2 

■  <C2,  l2t 

IO2, 

,  M2, 

O2, 

F2> 

be  logical 

modules ;  let 

C 

-  Cl  +C2 

I  - 

ll  + 

I2 

10  »  lOi  + 

M 

-  Ml  +  M2 

0  - 

0i  + 

O2 

and  define 

F(c,  i,  lo. 

m)  -  Fi(ci 

.  ll, 

.  lOl: 

,  «1 

)|F2(C2,  I2* 

for  each  con^lnatlon  of  control  input  10  and  memory  vectors. 

Then  L  ■  <C,  1,  10,  M,  0,  F>  is  a  logical  module  with  parallel 
submodules.  The  boundary  of  L  is  <Ci  +  C2,  Ij  +  I2 ,  lOi  +  102,  0i  +  02>. 
dim  6L  -  dim  6L1  +  dim  6L2 .  There  are  no  Interconnects. 

Widened  Modules 

Using  homogeneous  submodules  it  is  easy  to  define  a  wider  module 
with  the  same  controls.  Since  homogeneous  modules  are  logically 
equivalent,  we  may  use  the  ordering  assigned  by  the  equivalence  and 
identify  corresponding  controls.  This,  in  effect,  gangs  the  controls 
of  the  submodules  together.  The  inputs,  outputs  and  memories  are 


combined 

in  parallel  as 

above . 

Let 

C  “  Cl  C2 

I  -  ll  +  I2 

10  -  lOi  +  IO2 

M  -  Ml  +  M2 

0  -  0i  +  O2 

and  define 

F(c, 

i,  lo,  m)  -  F 

(c,  ll,  lOl, 

■i)|  F2(c,  I2, 

'  I 


I 

I 

I 


for  each  combination  of  control,  input,  10  and  memory  vectors. 
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dim  6L  =  dim  Cj  +  2  dim  Ij  +  2  dim  lOi  +  2  dim  Oj  <  dim  6Li  +  dim  6L2 
There  are  no  Interconnects. 

An  example  of  the  realization  of  a  widened  module  is  a  12-blt  register 
made  from  three  4-blt  register  chips. 

Composition  of  Modules 

Logical  modules  may  be  combined  by  composition  in  two  Important  ways, 
viz  the  output  of  one  may  provide  inputs  or  controls  for  another. 

In  either  case,  the  composition  mapping  represents  Interconnects. 

Function  Composition 

If  the  dimension  of  the  output  space  of  the  first  module  agrees 
with  the  dimension  of  the  Input  space  of  the  second  module,  then  the 
composition  of  these  Is  a  logical  module.  Let 

C  «  Cl  +  C2  I  =•  Ii  10  -  lOi  I-  IO2 


M  «  Ml  +  M2  0  »  O2  and  define 
F(c,  1,  io,  m)  -  F2(c2,  Fi(ci,  ii ,  ioi ,  mi),  I02 ,  m2) 

for  each  combination  of  control,  input,  10,  and  memory  vectors. 

Such  modules  are.  In  effect,  cascaded  and.  In  the  case  that  Li  can 
be  preparing  Its  new  outputs  without  modifying  its  prior  outputs, 
this  composition  Is  said  to  form  a  pipeline. 

Control  Composition 

If  the  output  space  of  one  module  coincides  with  the  control 
space  of  another  module,  the  composition  of  these  modules  Is  a 
logical  module. 

Let  dim  0i  dim  €2, 

C  -  Cj  I  -  Ii  +  l2  10  -  lOi  +  IO2 
M  ■  Ml  +  M2  0  ■  O2  and  define 
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F(c,  i,  lo,  m)  ■  F2(Fi(ci,  ij,  loi,  mi),  I2,  I02,  m2) 

for  each  combination  of  control.  Input,  10  and  memory  vectors. 

In  such  a  case  as  this  we  say  the  module  Li  has  provided  the  control 
for  module  L2 .  Such  a  module  Is  called  a  control  module . 

In  the  various  composite  modules  treated  above,  we  have  limited  our¬ 
selves  to  simple  combinations,  yet  combinations  of  these  combinations 
will  provide  most  of  what  Is  required  to  treat  significant  computer 
systems . 
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D.  Defining  Logical  Modules 

The  definitions  of  the  previous  sections  were  of  conceptual  Intent;  that 
Is,  they  essentially  assigned  formal  names  to  Intuitive  concepts.  In  order 
to  translate  these  concepts  Into  somethings  a  computer  can  get  Its  teeth  into, 
we  must  present  a  module  definition  language  and  a  corresponding  data  struc¬ 
ture.  Since  module  combinations  can  be  completely  described  In  terms  of  the 
module  boundary  and  the  boundaries  of  the  submodules,  the  language  Is  a 
vehicle  for  specifications  of  boundaries. 

Three  kinds  of  documents  suffice  for  logical  module  definitions: 

1)  definition  document  for  a  primitive  module 

2)  definitive  document  for  a  composite  module 

3)  definition  document  for  a  module  which  Is  a  copy  of  a  previously 
defined  module  (either  primitive  or  composite) . 

Definition  document  for  a  primitive  module 

The  boundary  elements  of  the  module  must  be  ordered  and  assigned  'local' 
names.  For  example.  If  the  module  Is  to  be  realized  as  a  'chip,'  the  names 
given  to  the  pins  by  the  manufacturer  will  suffice.  The  document  then  con¬ 
sists  of  a  list  of  those  names,  together  with  a  description  of  the  use  made 
of  each  boundary  element  (control.  Input,  10,  output).  The  amount  of  memory 
required  (If  any)  must  be  detailed. 

In  addition,  pointers  must  be  provided  to  subroutines  which  compute  the 
module's  function.  Up  to  three  such  subroutines  are  allowed.  This  allows 
control.  Input,  10  and  output  vectors  to  be  divided  Into  convenient  subsets, 
and  enables  subsets  of  outputs  to  be  computed  from  subsets  of  Inputs. 

Definition  document  for  a  composite  module 

The  document  contains  a  list  of  submodules  of  the  module.  Then,  for 
each  element  of  the  composite  module  boundary,  two  items  of  Information  must 
be  given:  the  use  of  this  element  (input,  control,  etc.)  and  the  signal 
name  associated  with  that  clesent.  Each  signal  name  must  appear  in  the 
definition  docuamnt  of  at  least  one  submodule. 

The  signal  names  In  the  document  definitions  for  the  module  and  Its  sub- 
swdules  determine  the  boundary  correspondences  and  the  Interconnects 
between  the  submodules.  The  major  design  step,  then,  is  encompassed  In  the 
assignment  of  signal  names. 


26 


Definition  document  for  a  copy  of  a  previously  defined  module 

In  such  a  document,  the  name  given  in  the  original  definition  document 
must  he  provided.  Then  the  boundary  elements  of  the  module  are  assigned  new 
signal  names.  The  elements  are  listed  in  the  same  order  as  in  the  original 
module  definition  document.  For  each  boundary  element,  its  type  (control, 
input,  etc.)  as  well  as  its  assigned  (new)  signal  name  is  designated. 

Using  such  documents  allows  many  copies  to  be  made  from  one  definition 
document.  In  the  case  of  primitives,  this  avoids  proliferation  of  copies  of 
the  subroutine  for  the  module.  In  the  case  of  composite  modules,  it 
enhances  modularity  by  allowing  one  set  of  signal  names  to  be  associated  with 
boundary  elements  within  a  module,  and  a  different  set  to  be  assigned  to  those 
same  elements  when  the  module  is  used  as  a  component  of  a  larger  module. 

If  the  definition  document  for  a  submodule  Li  of  a  module  L  is  Itself  a 
composite  module,  definition  document,  then  the  signal  names  connecting 
boundary  elements  of  L  with  boundary  elements  of  Li  will  continue  into  the 
interior  of  Li .  Such  a  signal  connects  at  least  three  levels  of  the  structure. 

This  runs  counter  to  modularity  but  is  allowed  in  our  system  for  flexibility. 

When  the  design  documents  for  all  submodules  of  a  module  are  copies  of 
previously  defined  modules,  all  signals  among  the  boundary  elements  of  the 
module  and  the  submodules  stop  at  the  boundaries  of  the  submodules,  preserving 
modularity.  That  Is,  wiring  between  modules  Is  Independent  of  wiring  within 
those  modules. 

Examples  of  definition  documents 

We  Illustrate  the  concepts  of  the  previous  sections  by  Including  printouts 
of  design  documents. 

Document  1  Is  a  composite  module,  definition  document  for  an  8-blt  counter,  CNTR8. 
It  Is  composed  of  the  submodules  CNTl,  CMT2,  CNTB,  and  has  24  boundary  elements. 
Signal  names  on  the  same  line  represent  elements  of  the  same  type  —  control. 

Input,  etc.  The  signal  names  listed  here  appear  In  the  design  definition 
documents  for  the  modules  CNTl,  CNT2  and  CNT8. 

Document  2  defines  a  module  ADDR8  which  Is  a  copy  of  the  CNTR8  module. 

ADDR8  Is  coiid>lned  with  another  module  to  form  a  module  PAD.  The  definition 
document  for  PAD  Is  shown  in  document  3. 


Those  signal  names  In  docusient  2  which  also  appear  in  document  3,  (e.g. 
BUS  7)  detail  part  of  the  boundary  correspondence  for  the  composite  Ba>dule 
PAD.  Names  which  appear  in  document  2  but  not  in  document  3  (e.g.  A7)  will 
occur  in  the  definition  document  for  the  other  submodule  of  PAD  (RAM12). 
These  represent  Interconnects  betwen  the  submodules  of  PAD. 
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DOCUMENT  CNTRe  PAGE  1  LINE  1 

•CNTR0  0  bit  counter  module  with  bussed  end  regular 
outputs 

S'iBfWDS  CNT1,CNT2,CNTB 

C0NTR0L  LDn, INC,BA,BAn,CLK,Gln.G2n 

10  BUS  7 , BUSG , BUSS - BUs4 , 0US3 , BUS2 . BUS 1 .  BUSO 

0UTPUT  CNTr  -  CNTG  ,  CNT5  ,  CNT"f  ,  CNT3  ,  CNT  2  ,  CNT 1 ,  ONTO 

0UTPUT  CNT2R(fln 

END 


Docunent  1.  Definition  docuaent  for  a  composite  module 
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CMCUnENT 

*A00P8 

TYPE 

C0NTR0L 

BUS 

0UTPUT 

0UTPUT 


/ 


ADDR8  PAGE  1  LINE  1 

0  b ; t  ADDRESS  REGISTER/CNTR 
CNTR0 

LDn , I NO , BA , BAn - CLK , DC , GND 

BUSr , BUS6 , BUS5 , BUS^ , BUS3 , BUS2 , BUS I ,  BUSO 

Ar,A6,A5,A'#,A3,A2,Al,A0 

OPEN 

END 


Docunent  2.  Definition  docuaent  for  a  'copy'  of  the 
■odule  definition  docuaent  1 
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DOCUMENT 
'PAD 
counter 
SUBMODS 
INPUT 
INPUT 
C  CiNTRCL 
b  .‘S 

k^UTPUT 

w‘'.JTPUT 

FND 


pad  page 

256  word  by  12  bit 


1  LINE  1 

RAM  with  address  register 


:^DRB,RAM12 

mil , INIO, IN09, TN08, INOr, rN06 
'105,  IMO^,  IN03,  IN02,  INOl  ,  INOO 
■' ,  LDn ,  DC ,  I NC  ,  BA ,  BAn  ,  CLK 
'S  /' ,  B- .IS6  ,  BUSS ,  BUS^t ,  BUS3 ,  RUS2  , 

IT  1  1  iTI  IT1  n  rtll  iTi'.o  /HI  i-r-N.-.  .•-•x 


Document  3.  Definition  document  for  a  composite  module  on 
the  next  higher  level.  The  module  defined  in 
document  2  is  a  submodule  of  this  module. 
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E.  The  Integrated  Logical  Slwilator 


The  logical  module  simulator  permits  verification  of  the  logical  design 
of  a  module  prior  to  Its  construction.  It  also  can  be  used  to  compute 
expected  results  for  comparison  with  values  measured  using  the  module  analyzer. 
In  addition,  algorithm  development  for  a  device  can  be  carried  out  using  the 
simulator. 

As  part  of  an  Integrated  logical  design  facility,  the  simulator  uses  the 
common  logical  module  structure  for  description  of  the  module  being  simulated. 
The  module  Is  defined  In  terms  of  Its  boundary  elements  and  Its  constituent 
submodules.  These  submodules  may  themselves  be  composite  modules,  or  they 
may  be  uses  of  previously  defined  modules.  At  the  lowest  level,  these  are 
primitive  modules  with  associated  programs  to  perform  their  function.  It  Is 
also  possible  to  replace  a  module  In  the  simulation  with  the  physical  device 
using  the  module  analyzer.  In  this  mode,  signals  to  and  from  the  module 
analyzer  are  used  with  those  produced  through  simulation  of  the  remaining 
modules. 

The  simulation  Is  performed  at  discrete  time  steps.  For  each  step,  the 
processing  consists  of  computing  the  outputs  of  the  module  from  given  Inputs 
and  controls.  All  output  computation  actually  takes  place  at  the  lowest,  or 
primitive  level.  Input  and  control  signals  are  distributed  from  higher  level 
modules  down  through  submodules  until  primitive  modules  are  reached.  Signals 
are  collected  at  the  boundary  of  each  primitive  module  until  those  needed  to 
compute  outputs  are  available;  then  outputs  are  computed.  These  outputs  are 
then  distributed  throughout  the  module  structure  wherever  they  are  needed  as 
Inputs  or  control  signals.  The  processing  for  a  step  Is  complete  when  all 
primitive  modules  have  computed  their  outputs. 

A  logical  module  Is  described  to  the  simulator  by  specifying  Its  boundary 
elements  and  either  Identifying  the  module  type  from  a  library  of  previously 
defined  modules  or  by  listing  the  constituent  submodules.  The  submodule 
descriptions  are  in  the  same  form:  either  uses  of  previously  defined  modules  or 
sets  of  Interconnected  smdules.  The  boundary  specification  for  each  module 
Is  ordered  by  position.  The  use  of  the  boundary  element  (Input,  Output, 

Control,  Bus)  Is  given  explicitly.  The  signal  name  given  for  the  element 
defines  the  Interconnection  structure  between  submodules  and  to  the  boundary 
of  the  composite  module. 


Primitive  modules  are  defined  for  simulation  by  providing  an  executable 
subroutine  which  computes  the  outputs  of  the  module  from  its  inputs  and 
controls.  In  addition,  a  table  must  be  provided  which  describes  the  use 
made  of  each  boundary  element  of  the  module,  together  with  the  amount  of 
memory  which  the  module  contains.  Each  such  module  can  have  up  to  three 
separate  subroutines.  This  permits  computation  of  some  outputs  based  on  the 
presence  of  only  a  partial  subset  of  the  inputs.  This  simplifies  the  pro¬ 
cessing  of  modules  containing  memory  elements  whose  state  is  to  be  used 
during  the  current  step  and  updated  for  use  in  the  following  step.  One  sub¬ 
routine  makes  their  outputs  available  as  soon  as  the  appropriate  control 
signals  are  present,  another  updates  their  internal  state  when  the  input  data 
is  available. 

A  two-part,  parallel  data  structure  is  used  for  the  logic  simulation. 

The  first  part,  the  interconnect  table  (Table  3.1),  represents  the  logical 
interconnections  of  the  boundary  of  a  module  with  the  boundaries  of  its 
constituent  submodules.  This  structure  provides  the  paths  for  transfer  of 
logic  values  from  their  source  to  d.1  modules  where  they  are  used.  Each  module 
entry  in  this  structure  has  a  set  of  pointers  to  the  entries  for  each  of  its 
submodules  and  a  pointer  entry  for  each  boundary  element  of  the  module  and  of 
each  of  its  submodules.  These  pointer  entries  are  linked  together  in  circu¬ 
lar  lists  connecting  all  boundary  points  which  share  a  comann  signal  name. 
Primitive  module  entries  in  this  structure  contain  the  subroutines  for 
computing  their  outputs.  If  the  same  module  is  used  more  than  once,  only 
one  copy  of  it  is  needed  in  this  structure  since  its  connections  to  other 
modules  for  each  use  are  described  at  the  next  higher  level. 

The  second  structure  used  for  simulation  is  a  state  table  which  maintains 
the  record  of  the  current  state  of  the  module  (Table  3.2).  This  table 
includes  the  memory  for  each  primitive  module  and  the  current  state  of  all 
its  boundary  elements.  At  higher  levels,  this  table  contains  only  a  list  of 
pointers  to  the  entries  for  each  submodule.  Since  identical  modules  need 
not  have  identical  states,  this  table  has  separate  entries  for  each  use  of 
a  module. 

These  data  structures  are  created  from  the  text  description  of  a  module 
in  a  two-stage  process.  Each  composite  module  is  first  processed  by  the 
logic  simulation  assembler  to  produce  its  interconnect  table,  which  shows  all 
logic  variables  present  on  its  boundary  as  well  as  all  interconnections 
between  the  submodules  which  make  up  this  module.  The  output  of  the  logic 
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assenbler  for  a  module  Is  this  Interconnect  table  and  a  list  of  the  sub- 
modules  referenced.  When  a  module  Is  to  be  simulated,  the  Interconnect 
tables  for  all  modules  contained  In  It  are  loaded  along  with  primitive  module 
code  and  linked  together.  At  the  same  time,  the  state  table  Is  constructed 
and  the  Initial  variable  state  filled  In.  This  completes  the  second  stage  of 
the  creation  of  the  data  structures  needed  and  Is  followed  by  the  actual 
simulation  steps. 
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Table  3.1  Interconnect  Table  Formats  for  a  Module 


a.  Nonprlmltlve  Module 


Word  # 


Contents 


0  #  submodules  +  1  (Hl-1) 

1-M  Pointer  to  each  submodule  entry  (*-dlsplacements) 

M4-1  #  boundary  elements  for  module  (N) 

M+2  -  M4-N+2  Pointer  entry  for  each  boundary  element 

HfN+3  -  END  Entries  for  submodules 

A  (Interconnect  table  for  submodule)  filled  In  during 
second  phase. 

One-word  pointer  entry  for  each  boundary  element  of  sub- 
module,  contains  submodule  Index/pln  Index  of  next  use  of 
signal.  Each  element  entry,  whether  on  the  boundary  of 
the  module  or  some  submodule.  Is  a  circular  list  link  of 
the  form:  submodule  Index/element  Index  where  submodule 
Index  0  refers  to  the  boundary  of  the  module.  All 
elements  with  the  same  signal  name  are  linked  together 
In  a  ring. 

b.  Primitive  Module 

Word  0:  length  of  state  table  entry  for  module 

Word  1:  A  (source  for  state  table  entry) 

Word  2:  A  (primary  processing  code) 

Word  3:  control  entry  1  A(flrst  control  processing  code) 

Word  4:  control  entry  2  A(second  control  processing  code) 

A  primitive  module  has  up  to  three  processing  sections. 
Each  section  has  Its  own  entry  point  and  Is  executed 
according  to  the  following  rules: 

1.  Entry  point  given  at  module  base  -1-2.  Standard  processing. 
Executed  when  the  specified  number  of  scheduling  Inputs  and 
controls  are  valid. 

2.  Entry  at  module  base  +  3:  Executed  when  control  with  subtype 
1  Is  valid. 

3.  Entry  at  module  base  -f  4:  Executed  when  a  control  with  sub- 
type  2  Is  valid  and  low  or  a  control  with  subtype  3  Is  valid 
and  high. 
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Table  3.2  State  Table  Foraats 


a.  State  Table  Nonprlmltlve  Module  Entry 


Words  1  to  N-1:  Address  of  state  table  entry  for  each  submodule. 
Word  N:  #1  address  of  state  table  entry  for  last  submodule. 


b.  State  Table  Primitive  Module  Entry 
Word  0:  #2 


IJ 

1  needed 

#  received 

15  14 


8  7  6 


R  and  S  are  set  when  control  entry  1  or  2  respectively  can  be 
executed. 

#  received  is  a  count  of  the  number  of  signals  which  have  their 
schedule  bit  set  which  have  been  received. 

#  needed  is  the  number  of  these  signals  which  must  be  present 
before  the  primary  entry  can  be  executed. 


Words  1  to 


N 

2* 


[ni 


ST 


sl  V  In  ic  1  T  1  ST  Is  |v  In  |c 


15  14  13  12  11  10  9  8  7  6  5  4  3  2  10 


One  byte  for  each  signal: 

T  *  Type  00  ■  Control 
01  *  Output 

10  ••  Input 

11  s  dus 

ST  ■  Subtype  Output  Bus 

00  »  Comblnatorlal/Ungated 

10  *  Latched/Ungated 

01  »  Combinatorial/enabled 

11  -  Latched/enabled 


Control 
00  ■  Other 

01  *  lomedlate  entry  1 

10  ■  Active  low  -  entry  2 

11  •  Active  high  -  entry  2 


S  ■  Schedule  bit  -  Increment  received  count  when  this  signal  is  received 
V  ~  Valid  -  set  when  signal  is  valid 

N  -  Next  Value  -  retains  value  for  use  in  clocked  outputs 
C  >  Current  Value  -  state  of  signal  if  V  bit  is  set. 

The  last  signal  word  has  its  flag  1  set.  If  the  module  requires 
memory,  it  will  be  located  inaedlately  after  the  signal  entries. 
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The  Integrated  Module  Analyzer 


The  Module  Analyzer  (MA)  performs  post  construction  verification  of  the 
logical  and  engineering  design-  At  this  stage  In  the  design  the  modules 
Involved  should  be  logically  correct.  However,  engineering  considerations  may 
affect  the  performance  and  Implementation  of  the  logical  design.  Timing 
characteristics,  signal  transmission  paths,  power  requiresients,  and  noise 
environment  are  among  the  possible  engineering  design  considerations  that 
may  affect  the  final  form  of  the  hardware. 

The  realization  of  the  prototype  MA  was  Influenced  by  two  factors: 

(1)  the  desire  to  have  Its  elements  under  direct  external  software  control, 
and  (2)  to  have  a  working  prototype  quickly  available.  These  two  factors 
resulted  In  a  device  that  has  overall  structural  simplicity.  The  final 
Implementation  of  the  MA  may  Involve  more  sophistication  that  could  allow 
more  automation  of  hardware  checkout.  For  example,  the  manual  switches  used 
for  sampling  selection  could  be  electronic  switches  under  program  control. 
Nevertheless  the  prototype  MA  has  proven  Itself  a  useful  tool.  The  remainder 
of  this  section  documents  the  MA  and  contains  the  following  paragraphs: 
Paragraph  1  Is  a  functional  description  of  the  entire  unit  Including  each  of 
Its  major  registers.  Paragraph  2  describes  how  to  operate  the  MA  using  the 
manual  controls,  external  connections,  and  programmatic  capabilities. 

Para.  3  Is  a  detailed  circuit  description.  Appendices  Include  sample 
microprograms,  and  a  complete  set  of  drawings. 

1.  Functional  Description 

The  main  features  of  the  KA  are  depicted  In  the  block  diagram  of 
Figure  3.1.  Generally,  the  purpose  of  the  MA  Is  to  provide  both  static  and 
dynamic  inputs  to  a  board  under  test  and  then  to  sample  the  board's  outputs 
after  a  specified  time  period.  This  time  period  Is  specified  by  external 
control,  so  that  a  sequence  of  tests  can  determine  when  an  output  changes 
within  5  ns.  External  addresses  and  data  are  supplied  on  PAXOO*  to  PAX05* 
and  10X00  to  10X15,  respectively.  The  Internal  registers  are  addressed 
according  to  the  decoding  scheme  Included  in  Figure  3.1. 

The  Buffer  Register  (BR)  has  several  purposes.  It  serves  as  a  con¬ 
verter  from  (or  to)  external  16-blt  data  to  (or  from)  Internal  140-bit  data. 
This  Is  accomplished  by  nine  separately  addressable  16-blt  registers  within 
BR.  After  data  has  been  transferred  Into  BR,  the  Information  may  then  be 
loaded  Into  either  the  Input  Register  or  the  Control  Register,  both  140  bits. 
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BR  also  serves  as  the  data  storage  register  when  the  board  under  test  is  sampled. 

The  Input  Register  (IPR)  provides  static  Input  data  to  the  board  under 
test.  It  Is  a  140-blt  register  that  is  loaded  from  IR.  Its  outputs  may  be 
connected  to  the  board  under  test  by  switch  settings. 

The  Control  Register  (C#N)  provides  dynamic  input  data  to  the  board 
under  test.  The  clocking  of  O0M  provides  an  Initiate  pulse  to  the  MA  that 
begins  the  test  sequence.  This  Initiate  pulse  defines  the  time  of  change  of 
any  bit  in  C0N  and  also  provides  a  start  pulse  to  the  counter. 

The  Counter  consists  of  two  elements,  the  Count  Save  (CS)  register  and  a 
SO  Mhz  up-counting  register  (CRT).  The  counter  has  two  functions:  It 
generates  timing  strobes  for  the  various  data  transfer  operations  to,  from, 
and  within  the  HA.  It  also  provides  the  sample  strobe  that  defines  when 
data  from  the  board  under  test  is  loaded  into  BR  and  the  Status  Register. 

The  length  of  time  between  the  initiate  pulse  of  0#M  and  the  sample  pulse  of 
CNT  is  determined  by  the  nuaber  loaded  into  CS.  This  time  may  be  practically 
varied  from  30  ns  to  327  pa  in  5  ns  intervals. 

The  Status  Register  (STAT)  la  a  16-bit  register  whose  inputs  are  directly 
attached  to  external  probes.  These  probes  may  be  connected  to  any  signal 
on  the  board  under  test;  this  provides  the  capability  to  sample  16  internal 
states  in  addition  to  the  I/O  pins.  Data  from  STAT  is  directly  read  from  the 
X-Bua  since  it  is  only  16  bits  wide. 

Switches  on  the  MA  are  set  to  correspond  to  the  function  of  each  pin. 

Input  pins  are  switched  to  "I"  or  "C"  depending  on  whether  the  input  is  a 
static  or  dynamic  one.  At  sample  time  the  state  of  these  inputs  is  loaded 
into  BR.  Output  pins  are  switched  to  *'#"  so  that  at  sample  time  the  state  of 
the  output  pin  is  recorded.-  Power  and  ground  pins  are  switched  to  "6";  power 
wiring  is  not  a  logic  input  so  it  is  wired  separately  to  the  board  connector. 

2.  Operation 

The  operation  of  the  MA  Is  govsmed  by  three  conditions:  (1)  External 
connections,  (2)  switch  settings,  and  (3)  control  of  external  logic  signals. 

The  external  connections  must  be  made  prior  to  powering  on.  The  power 
cable  must  be  plugged  to  both  the  MA  and  an  external  5V  power  supply.  A 
50-pin  flat  cable  must  be  connected  to  Connector  MPl.  This  provides  external 
input /output;  pin  assignments  are  given  in  Table  3.3.  The  opposite  end  is 
connected  to  the  device  providing  control.  The  board  to  be  tested  should  be 
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Table  3.3 

MPl  Pin  Aanlgnaenta 

1  lia 

Signal 

Pin 

Signal 

1 

1 

I#X00* 

26 

10X12*R 

1  ' 

I#X0O*R 

27 

10X13* 

*  3 

1#X01* 

28 

10X13*R 

1  * 

itfxom 

29 

10X14* 

1  5 

10X02* 

30 

I0X14*R 

I0XO2*R 

31 

10X15* 

j  7 

10X03* 

32 

10X15*R 

8 

10XO3*R 

33 

10XBilA* 

i  9 

10X04* 

34 

10XnA*R 

10 

I0XO4*R 

35 

UNUSED 

\ 

10X05* 

36 

UNUSED 

12 

I0XO5*R 

37 

PAXOO* 

i 

10X06* 

38 

PAXOO*R 

*  14 

10XO6*R 

39 

PAXOl* 

10X07* 

40 

PAX01*R 

1  16 

10XO7*R 

41 

PAX02*R 

17 

10X08* 

42 

PAX02*R 

1 

10XO8*R 

43 

PAX03* 

19 

10X09* 

44 

PAX03*R 

1  20 

10X09 *R 

45 

PAX04* 

21 

10X10* 

46 

PAX04*R 

1  22 

10X1O*R 

47 

PAX05* 

23 

10X11* 

48 

PAX05*R 

1 

10X11*R 

49 

UNUSED 

1  25 

10X12* 

50 

UNUSED 

I 

I 

[ 
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placed  in  the  edge  connector  provided  on  the  top  of  the  MA.  When  these  connec¬ 
tions  have  been  made  the  power-on  switch  way  be  enabled. 

Before  programmatic  manipulation  is  begun,  the  desired  switch  settings 
should  be  made.  Output  pins  of  the  board  under  test  should  have  their  corres¬ 
ponding  switches  set  to  "0".  Inputs  should  be  set  to  "I"  or  "C"  depending  on 
whether  the  pin  will  have  a  static  or  dynnic  input.  In  addition  to  switch 
settings  the  user  may  desire  to  attach  any  or  all  of  the  16  probes  to  internal 
points  on  the  board. 

Programmatic  control  of  the  MA  is  achieved  by  manipulation  of  the  address 
(PAXOO*  to  PAXIS*),  data  (10X00  to  10X15).  and  enable  (I0XENA*)  signals. 
Generally  the  procedure  to  perform  a  test  sequence  is  as  follows: 


(1) 

Reset  the  MA  (PA  34|). 

(2) 

Load  CS  with  the  desired  16-bit  count 

(PA  -  31|). 

(3) 

Load  BR  with  the  desired  data  for  IPR 

(PA  20  to  30). 

(4) 

Transfer  BR  to  IPR  (PA32). 

(5) 

Load  BR  with  the  desired  data  for  C0N 

(PA  20  to  30). 

(6) 

Transfer  BR  to  O0M  (PA33) ;  this  initiates  the  count  previously  loaded 
into  CS. 

(7) 

Walt  for  sample  pulse. 

(8) 

Read  BR  (PA  40  to  SO). 

(9) 

Read  STAT  (PA  51). 

These  data  transfers  are  controlled  by  internally  generated  timing  strobes. 
These  strobes  define  the  times  during  which  comsMnds  and  data  are  to  be  supplied 
to  the  MA,  and  they  define  when  NA  output  data  is  be  externally  sampled.  The 
timing  of  input  to  the  MA  is  given  in  Figure  3.2.  goth  I0X  (data)  and  address 
(PA*)  should  be  changed  a  adnimum  of  100  ns  before  making  I0XENA*  active. 

I0XENA*  enables  the  X-Bus,  and  820  ns  later  the  timing  pulse,  STB0BE1*,  is 
activated.  In  Figure  3.2  the  signal  BEG  DECODE  represents  the  register  enable 
signal  that  is  a  decode  of  FA  20-34.  The  coi0>ination  of  decode  and  STI(0BE1* 
generates  the  appropriate  register  clock.  This  latches  the  data  in  the  corres¬ 
ponding  register.  The  timing  is  the  same  for  clocking  external  data  into  a 
16-bit  segment  of  BR,  trsnsfarring  140  bits  from  BR  to  IFR  or  O0M,  and 
generating  a  reset  pulse.  The  user  must  simply  keep  I0XIMA*  active  for  more 
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I0XENA* 


I 

X 

STOBEl* 

REGDECODE 

REGCLK 


I0X 
PA* 

I0XENA* 

X 

STUaBEl*  ■ 
STR(fBE2*  ‘ 

STROBE3*  * 


BEGDECODEQ. 


than  900  ns  and  then  data,  address  and  I0XEMA*  say  be  aade  inactive 
s iBu 1 t aneous ly . 

The  tlalng  for  output  froa  the  MA  Is  siallar  to  the  Input  tiaing  with  the 
addition  that  the  output  data  auat  be  saapled  during  a  specified  tlae  period. 
Output  tlalng  Is  given  in  Figure  3.3.  The  tiaiag  for  output  is  the  saae  as 
input  thru  the  generation  of  STMBEI*.  This  allows  the  KA  to  decode  the  add¬ 
ress  of  the  register  being  selected  for  output.  However,  after  STR0BE1*  there 
are  additional  tiaing  rules.  The  user  aust  stop  driving  the  enable,  address, 
and  data  signals  before  STlt#BE2*  which  occurs  at  1140  ns.  At  this  tlae  the  MA 
will  begin  driving  the  I#X  lines  and  will  continue  until  STB#BE3*  at  1920  ns. 
Data  aust  be  saapled  between  1140  and  1920  ns. 

Appendix  1  gives  saaple  alcro  progrsas  for  controlling  the  MA.  They  are 
written  for  a  CHI  MP32A  Macro  Frocesaor.  The  Instruction  0IJTPUT  CLB  provides 
proper  tiaing  to  I0XEMA^  and  IMPUT  saaplcs  10X  during  the  tiae  it  is  driven  by 
the  MA.  These  aicro  routines  assuae  that  FD  (which  drives  I#X)  has  been  pre¬ 
viously  set  up  with  the  desired  data. 

3,  Circuit  Description 

This  section  will  first  describe  each  of  the  major  registers  within  the  MA 
and  then  describe  the  control  circuits  that  generate  the  proper  tiaing  and 
event  sequence.  Refer  to  the  logic  drawings  in  Appendix  2. 

The  Buffer  Register  (BR)  Is  the  only  relgster  that  functions  as  both  an 
array  of  16-blt  registers  and  as  a  140-bit  register.  BR  is  aade  of  DII8542 
4-blt  three-state  registers.  External  Input  data  (10X00  to  10X15)  is  received 
by  8T9S  three-state  drivers  whose  outputs  arc  connected  to  the  A-Bus  of  the 
8542.  The  140  bits  arc  segasnted  into  14-bit  groups,  each  group  having  a 
separate  input  and  separate  output  enable  controls.  These  controls  are  direct 
decodes  of  the  FA*  signals.  Although  the  entire  140-bit  register  aust  have  a 
coamon  clock  to  allow  loading  of  140  data  bits  froa  the  B-Bus,  sixteen  bit 
loading  is  accomplished  by  the  A-Bus  controls.  Thus  BR  is  externally  read  in 
16-bit  segments. 

When  data  is  to  be  transferred  from  BR  to  either  the  Input  or  Control 
Registers  the  B-Bus  output  enable  control  is  activated.  Data  Is  clocked  Into 
these  registers  at  STR0BE1*  time.  Both  the  Input  and  Control  Registers  (IFR 
and  C0M)  are  aade  of  SN74L8174  6-blt  registers. 
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The  outputs  of  IPR  end  0#M  asy  be  ewitchcd  to  the  board  under  test  and  in 
this  case  they  are  buffered  froa  the  l-Bua  of  the  8542  by  three-state  buffers 
(SN74367).  When  data  Is  to  ba  loaded  into  BR  froa  this  side  the  input  control 
LDBR*  activates  both  the  three-state  buffers  and  the  B-Bus  input  of  BR.  In 
this  case  BRCLK  is  generated  froa  the  counter. 

The  Status  Register  is  aade  of  AM2SS18  4-bit  three-state  registers.  Its 
inputs  are  directly  attached  to  the  external  probes.  It  is  clocked  at  the 
saaple  time  generated  by  the  counter.  Its  three-state  outputs  are  enabled 
onto  the  X-Bus  «dien  its  address  has  been  decoded.  It  is  read  in  the  saae 
Banner  as  any  of  the  16-bit  aegaents  of  BR. 

Tiaing  pulses  and  control  signals  are  generated  by  a  16-blt  up  counter. 

The  basic  clock  of  the  counter  la  supplied  by  a  50  Mis  oscillator.  The  outputs 
of  the  counter  are  used  in  two  functionally  different  ways  and  divide  into 
two  phases.  The  first  phase  generates  all  the  tiaing  and  control  signals; 
the  second  phase  generates  the  data  saaple  pulse  which  asy  be  varied  between 
30  ns  and  327  pa  in  5  ns  intervals. 

The  first  phase  generally  begins  with  the  activation  of  I0XEMA*  and  lasts 
until  STR0BE3*.  The  only  exception  to  this  is  when  the  data  saapllng  pulse 
is  to  be  generated  (see  below) . 

When  I0XQIA*  becoaes  active  it  clcofcs  CNTENAQ  which  enables  the  16-blt 
counter  (AM93S16).  CNTENAQ  is  also  wired  to  the  reset  input  of  each  stage  of 
the  counter  to  ensure  the  count  begins  at  sero.  The  counter  is  a  synchronous 
counter,  but  the  carryout  of  a  stage  is  clocked  into  the  next  stage  with  a 
delay  of  one  clock.  This  was  neceasary  because  the  delay  tiae  froa  clock  to 
carryout  plus  the  setup  tiae  froa  carry  in  to  clock  is  longer  than  the  20  ns 
clock  period  generated  by  the  oecillator.  Thus  it  was  necessary  to  stretch 
the  carryout  by  two  inverters  and  an  0R  gate  in  order  to  aeet  the  setup 
requireaents. 

The  strobe  pulses,  STR0BE1-3*  are  generated  by  siaply  using  an  8- input 
AND  gate  with  the  appropriate  Inputs.  STIOBEI*  generates  all  the  140-bit 
register  clocks:  C0UNTCLK,  IPRCLK,  O0NCLK,  and  BRCLKO.  ST1I0BE2*  clocks 
0UTENAQ,  the  enable  pulse  for  NA  output.  STN0BE3*  is  generally  used  to  reset 
control  flip  flops  to  their  original  stats  so  that  each  tiae  the  NA  is  addressed 
it  begins  operation  froa  the  saae  state.  It  rcseta  fUmiTQ.  017TENAQ,  LDBR 
and  CNTENAQ. 
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The  exception  to  this  sequence  Is  when  the  counter  Is  used  to  generate 
the  ssaple  pulse,  TO.  The  decode  of  ?A33  (Losd  Control  leglster)  generstes 
the  slgnsl  C0NCLK  st  STRdBEl*  tlae.  C0MCLK  slters  the  ususl  sequence  of  events. 
Referring  to  Figure  3.4  the  first  events  ere  the  sctlvstlon  of  STRINHQ*  end 
LDBR*.  STRINHQ*  disables  STROBE2-3*;  LDBR*  enables  the  B-Bus  Input  of  BR. 
Q^NCLK*  Is  used  to  enable  the  parallel  losd  feature  of  the  93S16  counter.  So 
the  contents  of  CS  are  loaded  Into  the  counter  at  every  DELCLK  until  C0NCLK* 
goes  inactive.  At  this  trailing  edge  of  C0NCLK  the  counter  begins  to  count 
and  the  Control  Register  Is  clocked.  Hence  the  dynawlc  Inputs  to  the  board 
under  test  change  at  the  saaw  tine  the  counter  begins  to  count. 

V/hen  the  counter  reaches  a  count  of  all  ones  the  sanple  pulse,  TO,  Is 
generated.  TO  generates  a  BRCLK,  so  the  data  fron  the  board  under  test  (or 
the  contents  of  other  registers  as  detemlned  by  the  switch  settings)  Is  loaded 
Into  BR.  TO  also  clocks  data  Into  the  Status  Register.  Additionally  TO 
resets  STRINHQ*,  so  that  the  counter  now  begins  counting  again  fron  zero. 
Therefore,  the  generation  of  the  STR^BEl-l*  pulses  proceeds  as  previously 
described  with  STR0BE3*  being  the  only  Inportant  event.  STR0BE3*  resets  any 
renalnlng  active  pulses  so  that  the  MA  Is  returned  to  Its  Idle  state. 

The  implementation  of  selectable  5  ns  delays  Is  accomplished  by  using  a 
delay  line  with  2-1/2  ns  delay  taps.  Delays  of  5,  10,  IS,  and  20  ns  are  used 
to  provide  four  selectable  clocks.  DELO  Is  selected  at  all  tines  except  when 
LDBR  Is  active.  When  LDBR  Is  active  the  delay  Is  selected  by  the  two  least 
significant  bits  of  CS. 
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Plgure  3.4  Test  Sequence  Timing 


APPENDIX  a;  sample  OUTPUTS  FROM  THE  STRUCTURED 
DESIGN  SYSTEM 


D*OW*KT  corrv^V 

•conrtWV 

^  ‘Part  rvMM 

1  TS«I 

i  TCLK 

3  V1S«I 

^  *<nciT 

6  01OO 

0  0101 

T  0102 

t  0103 

3  01C-^ 

iO  01C>5 

It  01C4» 

0107 
0108 
0109 
0110 
0111 
aAs«i 

OACI'r 
OCO 
OCl 
CC2 
0B£«  I 
OBCtr 
OCwK 
USel 
nPCLK 

nxcLK 

flVCLK 
npiavo 
<nxs«i 
nxciT 
nvsei 

ntcir 

cu< 

yns«i 


vncir 

0200 

0201 

0202 

0203 

0209 

0205 

0206 

0207 

0200 

0209 

0210 

0211 

HAS«I 

HACIr 

HCO 

HOI 

HC2 

»«S«I 

»«Clr 

>KLK 

VS«I 

NC 


PAOC 


•  r»*  r.*;  w-»., 


1  line  1 


PAOt 


1  LINE 


61 


I 

I 

I 

I 

I 

I 


I 

i 

I 

I 

I 

I 

I 

I 


tx^.'i^vr 

coonAPU 

Cll 

60 

GCIN 

61 

HCHt' 

6Z 

CM' 

63 

OfC< 

65 

65 

1100 

66 

1101 

67 

1102 

66 

1103 

69 

1105 

70 

1105 

71 

1106 

rz 

1107 

73 

IZCv' 

75 

1201 

75 

1202 

76 

I20'3 

77 

1205 

70 

I2ii5 

79 

T206 

90 

4  207 

81 

r2i''8 

6Z 

1209 

83 

1210 

85 

1211 

05 

1300 

86 

1301 

07 

1302 

08 

1303 

09 

1305 

9C 

1305 

91 

1306 

92 

1307 

93 

1308 

95 

1309 

96 

1310 

96 

1311 

97 

no 

90 

ni 

99 

n2 

100 

n3 

101 

n5 

102 

^eo 

103 

105 

HAO 

106 

HAl 

106 

H0PO 

107 

H0P1 

106 

HOPZ 

109 

080 

110 

081 

111 

OAO 

llA 

OAl 

113 

0*P0 

119 

O0P1 

116 

06P2 

116 

00 

117 

VO 

116 

I 


1 

I 


I 


iHlil-KiWM-J  Ilit  mwa 


cwoumnir  connAPO 

NC 


pAoe 


vcc 

vcc 


LINT 


I 

1 

*  <>hr*  ^  t  ly  •  ^  '^'li  1... 


IZi 


/ 


‘■viriMAPU 

•<«TU4PU 

PACE 
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Display 
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Locat Ion 
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JlO 
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J1 
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T1 
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fi 
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-\5 

C»H 
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^>A1 
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fZi 

0.6 

*13 

F-*? 

36,25 

HI 

R19 

0.0 

H5 

R^l 

36,25 
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N21 

0,6 

HA3 

NW3 
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►ei 

T21 

0,6 

^«3 
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36,25 

U1 
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0-6 

U3 

0^3 
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VI 

V21 

0,6 

V3 

V^3 
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xni 

Cl 

0,6 

XP!3 

Cl 
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Rl 

0,6 

VKS 

VI 
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OCM 

A1 
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DC06 
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s 
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1 
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Ul 

0,‘W 

ri 

ra 
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6 
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t 
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• 
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